## Readme 
This dataset contains information on the semantic similarity of patent-standard pairs at document and chapter level for three major standard-setting organizations.


# Version:
1.0 (1 August 2022)


# Authors:  
Fabian Gaessler, Max Planck Institute for Innovation and Competition, Munich, fabian.gaessler@ip.mpg.de 
Dietmar Harhoff, Max Planck Institute for Innovation and Competition, Munich, dietmar.harhoff@ip.mpg.de
Lorenz Brachtendorf, Max Planck Institute for Innovation and Competition, Munich


# Database description:
A detailed description of the database can be found in the following file (Section 4 and Appendix D):
Brachtendorf Gaessler Harhoff 2020 EPO ARP Approximating Standard Essentiality.pdf

An entity-relationship diagram of the database can be found in the following file:
ERD_stddb.pdf


# Files:
Due to their size, all files are provided in the compressed *.7z format. For use, they need to be extracted into their original tab-delimited *.csv format.
Note that the *_std_doc_text and *_std_ch_text files are not included to avoid copyright infringement. Interested scholars may reach out to the authors for further information.


# Cite data:
Please cite the following report when using the data:
Brachtendorf, Lorenz, and Gaessler, Fabian, and Harhoff, Dietmar. Approximating the Standard Essentiality of Patents – A Semantics-Based Analysis. Final report for the EPO Academic Research Programme, June 12, 2020.